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ABSTRACT 

The  design  of  a  decision  analysis  is  itself  a  complex  decision  prob¬ 
lem.  In  theory,  each  aspect  of  analysis,  encoding  the  probability  density 
functions  of  state  variables,  encoding  the  von  Neuman-Mo’-genstern  utility 
function,  and  computing  profit  lotteries  is  an  experiment.  The  results 
of  the  experiments,  the  data,  are  used  to  update  the  probabilities  in  the 
primary  decision  problem.  The  economic  value  of  the  experiment  is  the 
well-known  value  of  imperfect  information. 

The  drawback  to  the  theoretical  approach  is  that  the  data  are  func¬ 
tions.  Practical  methods  for  encoding  prior  distributions  over  functions 
do  not  exist.  Therefore,  the  traditional  approach  is  to  parameterize 
the  data. 

Our  approach  is  unique  because  we  show  that  for  an  interesting  class 
of  decision  problems,  arbitrary  parameterization  is  not  necessary.  The 
value  of  any  data  depends  probabilistically  only  on  the  prior  covariances 
of  the  posterior  means.  For  independent  state  variables  this  quantity 
reduces  to  an  estimate  of  how  much  the  mean  of  a  probability  density 
function  will  shift  during  an  experiment. 


THE  VALUE  OF  DATA  FOR  A  QUADRATIC  DECISION  PROBLEM 


Approximate  value  of  information  calculations  are  essential  in  per' 
forming  large  decision  analyses.  Based  on  a  preliminary  analysis,  the 
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analyst  must  decide  how  to  allocate  his  resources.  His  options  include 
traditional  experiments,  such  as  market  surveys  and  pilot  plants,  as 
well  as  additional  analysis.  The  encoding  of  probability  distributions 
and  risk  preference  functions  along  with  decision  trees  and  Monte  Carlo 
simulation  are  included  in  analysis. 

All  of  these  activities  are  directed  towards  updating  our  estimates 
of  the  outcome  or  value.  Their  worth  depends  on  the  prior  assessment 
of  how  much  they  might  change  our  decision  and  the  subsequent  gain  in 
expected  value.  However,  for  complex  decision  problems  finding  the  ex¬ 
act  value  of  data  requires  an  excessive  amount  of  encoding  and  computa¬ 
tion.  The  analyst  needs  to  be  able  to  make  rapid,  approximate  value  of 
information  calculations. 

This  paper  addresses  the  class  of  decision  problems  where  the  value 
function  is  approximately  quadratic  in  both  decision  and  state  variibles. 
The  main  result  is  that  the  value  of  an  experiment  depends  only  on  the 
prior  variance  of  the  posterior  mean.  This  is  a  tremendous  simplifica¬ 
tion  over  the  general  case  where  the  value  of  information  depends  on  the 
prior  probability  distribution  of  the  posterior  probability  distribution. 

After  we  have  proved  the  main  result,  we  examine  special  cases  to 
show  that  this  is  not  a  new  idea  but  a  generalization  of  an  old  one.  The 
special  cases  and  the  discussion  also  identify  the  data  required  to 
operationalize  the  theorem. 

1 .  Preliminaries 

In  this  section  we  introduce  inferential  notation  and  the  general 
terminology  required  to  describe  the  decision  problem. 

Notation 

Inferential  notation  explicitly  conditions  all  probabilities  on  a 
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state  of  information.  The  probability  density  function  of  a  random  var¬ 
iable  x  conditioned  on  the  state  of  information  g  is  denoted  by 


(1.1) 


[x|g]  . 


We  use 


of  x  is 


as  a  generalized  summation  operator;  thus  the  moment 


(1.2) 


ock|g>=  f  Xk  {x  |g] 
Jx 


whether  x  is  continuous  or  discrete.  Inferential  notation  can  be 
nested.  For  example, 


(1.3) 


{<x|s2>|g1} 


implies  that  the  mean  of  {x  | S2 )  is  a  random  variable  given  only  gj^  . 
In  addition  to  inferential  notation,  we  use  the  following  matrix 


symbols: 


a  or  a .  ] 

—  i 

A  or  [a.  .  ] 
ij 


a'  or  A' 

<a|g>  or  <at|g> 


The  underscored  lower  case  letter 

denotes  a  column  vector  with  element 

a.  . 
i 

The  underscored  capital  letter  de¬ 
notes  a  square  matrix  with  element 
a .  .  . 

The  prime  denotes  transposition  . 

A  probabilistic  operation  is  applied 
to  each  component  of  a  vector. 


The  Basic  Decision  Problem 

The  deterministic  model  illustrated  in  Fig.  1  relates  the  three  ele¬ 
ments  of  the  basic  decision  problem.  The  decision  variables  _d  are  set 
by  the  decision  maker.  The  state  variables  _s  are  set  by  nature.  The 
value  v  is  the  output  measure  that  we  want  to  maximize.  If  both  _s  and 
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(a)  THE  DETERMINISTIC  MODEL 


(b)  THE  PROBABILITY  TREE 


FIGURE  1  DESCRIPTION  OF  THE  BASIC  DECISION  PROBLEM 


^  ^j-0  known j  wg  dcnotG  the  decision  theit  insxiinizGS  ttiG  VciluB  function 

d+(s) 


(1.4) 


d+(s) 


max 

d 


v(s  ,d) 


However,  in  the  basic  decision  problem  illustrated  in  Fig.  1(b), 
d  must  be  set  before  _s  is  observed.  The  possible  outcomes  are  de¬ 
scribed  by  the  probability  density  function  [s|£]  »  where  £  is  the 
state  of  information  that  represents  the  decision  maker's  prior  knowledge 
and  experience. 

We  assume  that  js  independent  of  _d  in  the  sense  that 

(1.5)  Ls.!i>£]  =  lsl£}  • 


This  assumption  is  not  restrictive.  When  the  state  variables  are  depen¬ 
dent  on  the  decision  variables  the  problem  can  normally  be  reformulated 
so  that  the  dependence  appears  in  the  value  function. 

The  basic  decision  problem  under  uncertainty  is  to  maximize  the  ex¬ 
pectation  of  v  : 

(1.6)  max  [  v(s,d)  [s)£] 

d  •'<= 


The  expansion  rule  from  elementary  probability  theory  is 


(1.7) 


<x|£>  =  J  <x|y,£>  [y |£} 


Using  this  rule,  we  can  show  that  the  inferential  symbol  for  the  expecta¬ 
tion  in  (1.6)  is  <v[d,£>  : 


-5- 


(1.8) 


<vjd,£>  =  j  <v|_s,_d,£>  [sj£] 


The  expectations  in  (1.6)  and  (1.8)  are  the  same  since  the  expected 
value  of  v  ,  given  _s  and  d  is  deterministically  v(_s,_d)  . 

We  define  d+(£)  as  the  decision  vector  that  maximizes  the  expected 
value  of  v  : 


(1.9) 


_d+(£)  =  max  ^<V  [d,£> 
d  ' 


If  s  represents  some  possible  future  state  of  information,  we  define 
d*(S)  as  the  intent  to  use  d+(g)  when  g  becomes  available. 

The  Value  of  Information 

Suppose  that  an  analysis  or  experiment  will  provide  some  data  D 
Then  (D,£)  represents  an  improved  state  of  information.  We  define 
the  expected  value  of  the  data  <v^|£>  : 


(1.10) 


<vd|£>  =  <vjd*(D,£),£>  -  <v[d*(£),£> 


Since  £  is  our  prior  information,  d  (£)  is  known  and  thus 


(1.11) 


<v  j_d*(£)  ,£>  =  <v[d+(£),£>  . 


The  first  term  in  (1.10)  is  the  key  to  the  value  of  data.  Given  the 
data  D  we  would  find 


(1.12) 


d+(D,£)  =  max  <v[d,D,£>  , 
d 


which  would  result  in  the  posterior  expected  value  <vjd  (D,£),D,£>  . 
However,  before  D  is  revealed  we  must  compute  the  prior  expectation 
of  this  quantity  : 

(1.13)  <vjd*(D,e),D,£>  =  <<v|d+(D,£),D,e>|£> 
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v(s,d)  «  a  +  b's  +  C'd  . 

+^s'E5  +  s'Qd  +  $d'Hd 


d 


NOTATION 

'  Transpose  of  a  matrix 

a  Constant 

b,t  Constant  vectors 

s  State  variable  vector 

d  Decision  variable  vector 

E,G,H  Constant  square  matrices 


FIGURE  2 


THE  QUADRATIC  VALUE  MODEL 


2. 


The  Value  of  Data  for  a  Decision  Problem  with  a  Quadratic  Value 


Function 

In  this  section  we  find  the  expected  value  of  data  for  the  model 
illustrated  in  Fig.  2.  The  value  function  v(s,d)  is  quadratic  in  the 
state  vector  _s  and  the  decision  vector  _d  .  The  state  variables  have 
zero  mean,  and  the  decision  variables  are  zero  at  the  deterministic 
maximum  : 

(2.1)  <s | G>  *  0 


(2.2)  d*  ■  max  ^  v(<s  jS>,_d)  =  0 

These  assumptions  reduce  algebraic  complexity  without  sacrificing  gener¬ 
ality. 

We  write  the  quadratic  value  function  as 

(2.3)  v  (s  >iP  “*  a  ^ '_s  +  ^ *_d  +  _s * G  jd  +  _d 1 H  _d  . 


The  second-order  necessary  and  sufficient  conditions  for  v(s,dl)  to 
have  a  maximum  at  <s|S>  and  d*  are  that  the  gradient  of  v  with 
respect  to  d  VV ( | S.>»_dQ )  be  zero  and  that  the  Hessian  V  v(<s|C>,_d) 
be  negative  definite.  Using  (2.1)  and  (2.2)  the  gradient  and  Hessian 

at  <s  |£>  and  d+  are  defined  as  : 

— 1  — o 


av<o,o) 

(2.4)  7v(<s  je>,do)  =  — rj - 

°°i 

I _ 


(2.5) 


2  +  ,-d  v(0,0) 

7  v<^'e>'V  -  hd-^j 
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Applying  (2.4)  and  (2.5)  to  the  definition  of  v(s,d)  (2.3),  we  have  : 


(2.6)  W(0,0)  =  c ' 

(2.7)  V2v(0,0)=H 

Since  the  gradient  in  (2.6)  must  be  the  zero  vector,  our  assumptions  im¬ 
ply  that  c  must  also  be  the  zero  vector.  From  (2.7)  we  see  that  the 
Hessian  does  not  vary  with  s  and  d  for  the  quadratic.  Therefore  if 

the  detenr  inis  tic  optimum  d  exists,  H  is  negative  definite  and  the 

_0  — 

value  function  has  a  global  maximum  with  respect  to  d  for  any  state 
vector  j;  . 

Chronologically,  we  receive  the  data  about  the  state  variables. 

Then  we  set  the  decision  vector,  and  finally  nature  sets  the  state  vari¬ 
ables.  The  state  variables  are  independent  of  the  decision  variables  but 
not  necessarily  independent  of  each  other.  We  assume  that  the  decision 
maker  is  risk- indifferent  so  that  maximizing  the  value  function  is  equiva¬ 
lent  to  maximizing  the  decision  maker's  von  Neumann-Morgenstern  utility 
function . 

With  these  preliminatires  we  can  state  the  theorem: 

THEOREM:  For  the  quadratic  value  function 

(2.8)  v(s,d)  =  a  +  b's  +  s'E  _s  +  s'G  d  +  d'H  d 

where  the  Hessian  H  is  negative  definite,  the  value  of  any  data  D 
is 

(2.9)  <vD|e>  =  -  <C<S  |D,e>'  G  H_1  G'  <s|D,C>|e>  . 
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PROOF:  From  (1.10)  the  value  of  the  data  is 


(2.10)  <vD|£>  =  <v|d*(D,£)  ,D,£>  -  <v|d*(£),£>  . 

The  proof  is  in  two  parts  corresponding  to  the  two  terms  of  (2.10). 

First  we  determine  the  prior  maximum  expected  value  <v  |_d* (£)  , £>  ; 
then  we  determine  the  expected  value  given  the  opportunity  to  maximize 
after  the  data  is  received  <v  |d*(D ,£) ,D ,£>  . 

To  find  <v[d*(£),£>  we  start  with  the  prior  expected  value 
<v [d ,£>  .  Recalling  that  the  expected  value  of  the  state  variables  are 
all  zero,  the  prior  expectation  of  (2.8)  is 

(2. 11)  <v  [d  ,£>  =  a  +  <s 'E  _s  |£>  4-  d'H  d  . 

The  first-order  necessary  condition  for  <v|d  (£),£>  to  be  an  uncon¬ 
strained  maximum  is  that  the  gradient  be  zero  at  d+(£)  : 

(2.12)  V  <v|d+ (£),£>-  O' 

Taking  the  gradient  of  (2.11)  and  setting  it  to  zero,  we  have 

(2.13)  d+'(£)H«=0'. 

Since  H  is  negative  definite,  d+(£)  must  be  the  zero  vector.  There¬ 
fore  (2.11)  becomes 

(2.14)  <v  |d+  (£),£>=  a  +  <s'E  s  |£>  . 

Returning  to  the  first  term  in  (2.10),  the  expected  value  given 
data  D  is 
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Maximizing  (2.15)  with  respect  to  _d  we  have 

(2.16)  V  <v  jd+(D,e)  ,D,e>  =  <s  |E ,£>’  G  +  d'H  =  O'  . 

Equation  (2.16)  implies  that 

(2.17)  d+(D,G)  =  -  H_1  G'  <s|D,e>  . 

Substituting  (2.17)  into  (2.15),  we  have  : 

<V  |d+(D,£)  ,D,£>  =  a  +  _b'  <s|D,e>  +  ^  <s 'E  _s  |D,£> 

-  <s  |D,e>'G  h‘LG»  <s|D,£>  +  -|  <s|D,£>'  G  h"LG'  <s|D,£> 
=  a  +  b'  <s  jD,£>  +  <s  ' E  _s  |  D ,  £> 

(2.18)  +\  <s|D,e>'  G  H'  LG '  <s  |D  ,£> 

Recalling  (1.13),  the  next  step  is  to  take  the  prior  expectation 
of  (2.18).  V.e  shall  consider  each  term  separately.  Of  course,  expec¬ 
tation  does  not  affect  the  value  of  the  constant  a  .  The  prior  expecta¬ 
tion  of  the  posterior  mean  is  the  prior  mean  : 

(2.19)  <<s|D,e>|e>  =  <s|e> 

Equation  (2.19)  is  a  direct  application  of  the  definition  cf  conditional 
probability.  Likewise,  the  third  term  becomes 

<<s»e  s]D,e>|e>  -  <s ' e  _s  |e>  . 


(2.20) 


Applying  these  results  to  (2.18),  we  have 


<V  |d*(D  ,e)  ,e>  =  a  +  -|  <s'E  s  |e> 

(2.21)  -  ~  <<s|D,e>'  g  h_1g'  <s|D,e>je>  . 

Finally,  subtracting  (2.14)  from  (2.21)  the  rec.ult  is 

(2.22)  <vD|C>=-^  C<s|D,£>'  G  H_1G'  <s|D,£>|e>  . 

Q.E.D. 

Special  Cases  of  the  Theorem  that  Appear  in  the  Literature 

Three  special  cases  of  the  theorem  (2.9)  appear  in  the  literature. 
Howard  [2,  p.  518]  treats  the  case  where  H  is  diagonal  and  the  data  D 
is  clairvoyance.  DeGroot  [1,  p.  234]  solves  for  ^  ,  the  estimate  of 
the  random  variable  _s  which  minimizes  a  quadratic  loss  function.  In 
our  notation  his  problem  is  the  case  where 

(2.23)  E  *  -  H  ; 

a  ,  b  ,  and  G  are  zero;  and  D  is  c lairvouance .  Raiffa  and  Schlaifer 
[4,  p.  188]  present  the  one- dimensional  estimation  problem  without  re¬ 
quiring  the  data  to  be  clairvoyance. 

3.  Discussion  of  the  Value  of  Data  for  the  Quadratic  Problem 
An  alternate  expression  for  the  theorem  (2.9)  is  : 

(3.1)  <vnle>  =  -  \  trace  E  C 

D '  2  —co  —  D 

where 

(3.2)  E  =  G  H_1  G' 

—CO - — 
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(3.3) 


"d 


[<<s.|d,S>  <  s.|D,e>|p  > 


The  trace  of  a  matrix  is  the  sum  of  its  diagonal  elements.  The  value 
of  data  has  two  major  components.  The  basic  decision  problem  is  speci¬ 
fied  by  E  ,  and  the  experiment  is  described  by  C  .  We  consider 

“CO 

and  Cp  briefly  for  the  general  case.  Then  for  the  case  that  is  most 
common  we  discuss  how  E  an<^  — d  cou^  be  generated. 

E  follows  directly  from  (3.2)  for  a  true  quadratic  value  func- 

— co 

tion  since  the  matrices  G  and  H  are  specified.  For  a  problem  that  is 
only  approximately  quadratic,  G  and  H  can  be  found  by  expanding 
v(s,d)  in  a  Taylor  series  about  the  point  (<s |e>,d+(C) )  =  (0,0)  : 


v(s,d) 


v(0,0) 


(3.4) 


The  partial  derivatives  are  all  evaluated  at  the  point  (0,0)  .  Com¬ 
paring  (3.4)  with  (2.8),  we  see  that  G  and  H  must  be  matrices  of 
partial  derivatives  : 


(3.5) 


(3.6) 


The  partial  derivatives  at  the  operating  ooint  (0,_0)  can  be  approxi¬ 
mated  from  open  loop  sensitivities.  One  joint  sensitivity  is  required 
for  each  possible  pair  of  state  and  decision  variables  and  for  each  pos¬ 
sible  pair  of  decision  variables. 
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The  elements  of  the  matrix  are  the  expected  product  of  the 

posterior  means.  Since  the  prior  expectation  of  the  posterior  mean  is 
zero,  the  elements  are  the  covariances  of  the  posterior  means. 

When  the  data  is  clairvoyance  on  the  state  variables  _s  ,  (3.1)  reduces 
to 

(3.7)  <vc  |C>  -  -  \  trace  Eco  C  . 

If  we  consider  the  posterior  means  <s|D,£>  as  random  variables,  com¬ 
parison  of  (3.7)  and  (3.1)  implies  that  the  value  of  data  is  the  value 
of  clairvoyance  on  the  posterior  means.  In  most  practical  problems  the 
value  of  clairvoyance  on  the  posterior  mean  is  much  easier  to  compute 
than  the  value  of  clairvoyance  on  the  data  itself. 

An  Interesting  Special  Case 

The  most  interesting  special  case  occurs  when  either  Ecq  or  Cp 
is  a  diagonal  matrix.  Then  the  value  of  data  becomes 

(3.8)  <vD|£->  -  z  £!  h"1  £.  V<<si  |D,£>|e> 

where  the  vector  is  the  i1-*1  row  of  G  : 

(3.9) 


If  the  state  variables  are  independent  (3.8)  is  exactly  equal  to 

(3.1) .  Sufficient  conditions  for  (3.7)  to  be  a  good  approximation  to 

(3.1)  ar-j  that  the  diagonal  elements  dominate  the  off-diagonal  elements 

of  E  :  that  is,  for  each  i  and  i  : 

—co 


(3.10) 


2  (si  if1  sp  u; jrLs;> 

(Si  if1  ij)2 


p. .  « 
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where 


is  the  correlation  coefficient 


Given  G  ,  H  ana  Cp  ,  these  expressions  tell  us  when  the  diagonal 
assumption  holds.  A  more  interesting  question  is  whether  we  can  avoid 
generating  the  entire  matrices  G  ,  H  and  CD  .  The  answer  rs  yes 


as  shown  below. 


Description  of  the  Primary  Problem  Using  Closed  Loop  Sensitivities 


We  now  show  that  the  term  H  is  the  second  partial  denv- 

tVi 

ative  of  compensation  with  respect  to  the  i  state  variable  : 


(3.12) 


-1  a''co(si) 

1 


where 

(3.13)  vco(si)  =  Vc(si)  "  Vo(s’’) 

The  open  loop  sensitivity  is  evaluated  by  varying  s^  while  the 
other  state  variables  and  the  decision  variables  remain  constant.  We 
denote  the  open  loop  sensitivity  as 

+. 

(3.14)  v0(Si^  =  v(0,0, . . . ,si . °>d0'  ■ 

In  closed  loop  sensitivity  the  state  variables  other  than  s£  remain 
fixed,  but  the  decision  is  reoptimized  for  each  si  : 

(3.15)  v  (s.)  -  vi'0,0, . .  .  ,s  , . . .  ,0,d+(0,0, . . .  ,s  , .  . .  ,0)) 

Cl  1 

To  show  that  expression  (3.12)  is  valid  we  evaluate  v0(s^)  and 
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v  (s.)  for  the  quadratic  value  function  (2.8)  : 
c  1 


(3.16) 


1  2 

v(s.)=a  +  b.s.+-e..  s. 
o  i  i  l  2  ii  i 


v 

c 


(si} 


max(a  +  b . s .  +  — 
d  i  i  2 


2 

e .  .  s  . 
ii  i 


+ 


^ +  i 


d  H  d) 


(3.17) 


=  a  +  b .  s  . 

i  i 


2 

e .  .  s  . 
ii  i 


2 
s . 
l 


Subtracting  (3.16)  from  (3.17)  the  compensation  is 


(3.18) 


v  (s.)  = 
CO  1 


- 1  if  fit 


Therefore,  by  evaluating  the  compensation  curves  for  the  state  vari¬ 
ables,  the  need  to  find  the  matrices  of  partial  derivatives  G  and  H 
is  eliminated. 

The  Description  of  the  Data  Generating  Process  through  Preposterior 
Moments 

The  second  component  of  (3.8)  is  <C<s.|D,£>|^  the  prior  variance 
of  the  posterior  mean.  To  evaluate  this  term  we  use  the  theorem: 

(3.i9)  ^s|e>  =  ^<3  jD,e>je>  +  <%>|d,£>|£> 

A  proof  of  this  theorem  is  given  in  Raiffa  and  Schlaifer  [4,  p.  106]. 

The  theorem  states  that  the  prior  variance  <s|£>  has  two  sources.  The 
expected  posterior  variance  ^ <  s|D,£>|£^  is  a  residual  variance  which 
will  not  be  resolved  by  the  experiment  that  generates  the  data  D  . 

The  prior  variance  of  the  posterior  mean  <£<s |D,£>|£^  is  the  portion 
of  the  prior  variance  that  will  be  resolved  uy  the  experiment. 

Sample  Data 

Expression  (3.19)  is  best  known  for  the  case  where  data  are  N  random 
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samples  from  js|£j  .  First  we  consider  the  limiting  cases  of  no  samples 
and  of  infinite  samples.  Then  we  consider  a  finite  number  of  samples. 

When  the  data  is  the  null  experiment,  N  =  0  ,  the  prior  and  pos¬ 
terior  states  of  information  coincide.  Therefore,  we  have 

(3.20)  O'-s  |D,e|-e>  =  |e>|e >  =  y-s |c  ■ 

(3.21)  ^sjD,e  |e>  =  v<s|e-|e>  =  °  • 

When  the  number  of  samples  approaches  infinity,  the  data  is  clair¬ 
voyance  about  s  .  The  posterior  probability  density  function  will 
have  all  of  its  mass  at  a  single  point.  Consequently,  the  preposterior 
moments  are 


(3.22) 

<Js|D,e>|C>  =  ■'0  |£' 

(3.23) 

<;'siD,£>|e>  =  v's]e 

To  discuss  (3.19)  for  finite  N  it  is  convenient  to  define  the 
ratio  r 

(3.24)  r  =  V<^s|D, £>)£>/  <s|S>  . 

The  limiting  cases  are  r  =  0  for  the  null  experiment  and  r  =  1  for 
clairvoyance . 

A  Bayesian  must  assign  both  r  and  fs|£|  before  he  con  calculate 
the  expected  value  of  sample  information.  For  example,  Raiffa  and 
bchlaifer  [4,  p.  110]  suggest  assigning  an  equivalent  sample  size  N' 
to  the  term  V<<s  |D ,  •  Then  for  certain  conditions  the  parameter 

r  is 
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(3.25) 


N 

N'  +  N  ’ 

Assigning  other  r  or  N'  weights  the  prior  information  relative  to 
the  sample  information. 

Experiments  That  Do  Not  Involve  Sampling 

Encoding  and  modeling  are  analogous  to  sampling  because  they  par- 
£j_a]_ly  resolve  uncertainty  about  the  state  variable  s  .  Encoding  the 
parameter  r  or  equivalently  <(<s  |D ,£>j£^>  should  be  no  moie  aiffi- 
cult  for  these  cases  than  for  sampling. 

4 .  Conclusion 

Application  of  the  approximate  value  of  information  is  a  two-step 
process.  First,  the  value  function  must  be  approximated  by  a  Taylor 
series.  This  step  is  routine  given  deterministic  sensitivity  data. 

The  second  step  is  to  encode  the  prior  covariances  of  the  posterior 
means  of  the  state  variables.  Practical  applications  aru  the  result  of 
careful  modeling  so  that  the  encoding  problem  is  tractable. 
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